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IN THE CLAIMS 

Please amend/retain/add the claims as follows ^. ' 

1 . (Currently Amended) An automated method of designating text, taken from a set of 
citing documents, as reasons for citing (RFC) a cited document that are associated with 
respective citing instances of a citing document, the method comprising: 

obtaining contexts of the citing instances in the respective citing documents, each 
context including a text unit that includes the citing instance and a text unit that is near the citing 
instance; 

analyzing the content of the contexts , said step of analyzing including calculating 
a content score for each text unit based on text unit content words that are common to at least 
two of the citing documents' contexts or to at least one citing document's context and said cited 
document ; and 

selecting, from the citing instances' context, at least one text unit that constitutes 
the RFC, based on the analyzed content of the contexts. 

2. (Currently Amended) An automated method of designating text, taken from a set of 
citing documents, as reasons for citing (RFC) a cited document, said RFC being associated with 
respective citing instances of a citing document, the method comprising: 

inputting text from the citing documents; 

dividing the citing documents' text to define paragraphs, and dividing the 
paragraphs to define sentences; 
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obtaining contexts of the citing instances in the respective citing documents, each 
context including: a sentence that includes the citing instance and at least one sentence that is 
near the citing instance; 

generating a content word list based on containing content words that are in at 
least two of the citing documents' contexts or that are in at least one citing document's context 
and said cited document : 

calculating, for the sentences in the citing documents' contexts, respective content 
scores that are based on frequency counts of the content words that are recited in the respective 
sentences; and 

selecting, from the citing documents' contexts, the sentences at least one sentence 
that constitutes the RFC, based on the calculated content scores. 

3. (Currently Amended) The method of claim [[2]] I, wherein the step of analyzing the 
content content wo r d gen er ating step includes: 

generating the a content word list based on the content words that are included in 
the contexts of at least two of the citing documents , and assigning each of said content words a 
frequency count which is used in calculating the content score . 

4. (Currently Amended) The method of claim [[2]] 1, wherein the step of analyzing the 
content cont e nt wo r d gene r ating step includes: 
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generating the a content word list based on the content words that are included 
both in the cited document itself and in the context of at least one citing document , and assigning 
each of said content words a frequency count which is used in calculating the content score . 

Claims 5-15 (Canceled). 

16. (Currently Amended) An apparatus for designating text, taken from a set of citing 
documents, as reasons for citing (RFC) a cited document that are associated with respective 
citing instances of a citing document, the apparatus comprising: 

means for obtaining contexts of the citing instances in the respective citing 
documents, each context including a text unit that includes the citing instance and a text unit that 
is near the citing instance; 

means for analyzing the content of the contexts , said means for analyzing 
including means for calculating a content score for each text unit based on text unit content 
words that are common to at least two of the citing documents' contexts or to at least one citing 
document's context and said cited document ; and 

means for selecting, from the citing instances' context, at least one text unit that 
constitutes the RFC, based on the analyzed content of the contexts. 



4 




Serial No.: 09/468,785 

Atty. Docket No.: P64616US0 

17. (Currently Amended) An apparatus for designating text, taken from a set of citing 
documents, as reasons for citing (RFC) a cited document, said RFC being associated with 
respective citing instances of a citing document, the apparatus comprising: 

means for dividing the citing documents' text to define paragraphs, and for 
dividing the paragraphs to define sentences; 

means for obtaining contexts of the citing instances in the respective citing 
documents, each context including: a sentence that includes the citing instance and at least one 
sentence that is near the citing instance; 

means for generating a content word list bas e d on containing content words that 
are in at least two of the citing documents' contexts or that are in at least one citing document's 
context and said cited document ; 

means for calculating, for the sentences in the citing documents' contexts, 
respective content scores that are based on frequency counts of the content words that are recited 
in the respective sentences; and 

means for selecting, from the citing documents' contexts, the sent e nces at least 
one sentence that constitutes the RFC, based on the calculated content scores. 

1 8. (Currently Amended) The apparatus of claim [[17]] 16, wherein the content wo r d 
gene r ating means for analyzing the content includes: 



5 




Serial No.: 09/468,785 

Atty. Docket No.: P64616US0 

means for generating the a content word list based on the content words that are 
included in the contexts of at least two of the citing documents , and for assigning each of said 
content words a frequency count which is used in calculating the content score . 

19. (Currently Amended) The apparatus of claim [[17]] 16, wherein the c o ntent wo r d 
g e nerating means for analyzing the content includes: 

means for generating the a content word list based on the content words that are 
included both in the cited document itself and in the context of at least one citing document , and 
assigning each of said content words a frequency count which is used in calculating the content 
score. 



Claims 20-30 (Canceled). 



3 1 . (Currently Amended) A computer-readable memory that, when used in conjunction 
with a computer, can carry out a method of designating text, taken from a set of citing 
documents, as reasons for citing (RFC) a cited document that are associated with respective 
citing instances of a citing document, the computer-readable memory comprising: 

computer-readable code for obtaining contexts of the citing instances in the 
respective citing documents, each context including a text unit that includes the citing instance 
and a text unit that is near the citing instance; 
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computer-readable code for analyzing the content of the contexts including 
calculating a content score for each text unit based on text unit content words that are common to 
at least two of the citing documents' contexts or to at least one citing document's context and 
said cited document ; and 

computer-readable code for selecting, from the citing instances' context, at least 
one text unit that constitutes the RFC, based on the analyzed content of the contexts. 

32. (Currently Amended) A computer-readable memory that, when used in conjunction 
with a computer, can carry out a method of designating text, taken from a set of citing 
documents, as reasons for citing (RFC) a cited document, said RFC being associated with 
respective citing instances of a citing document, the apparatus comprising: 

computer-readable code for inputting text from the citing documents; 
computer-readable code for dividing the citing documents' text to define 
paragraphs, and dividing the paragraphs to define sentences; 

computer-readable code for obtaining contexts of the citing instances in the 
respective citing documents, each context including: a sentence that includes the citing instance 
and at least one sentence that is near the citing instance; 

computer-readable code for generating a content word list based on containing 
content words that are in at least two of the citing documents' contexts or that are in at least one 
citing document's context and said cited document ; 
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computer-readable code for calculating, for the sentences in the citing documents' 
contexts, respective content scores that are based on frequency counts of the content words that 
are recited in the respective sentences; and 

computer-readable code for selecting, from the citing documents' contexts, the 
s e nt e nc e s at least one sentence that constitutes the RFC, based on the calculated content scores. 

33. (Currently Amended) The computer-readable memory of claim [[32]] 31, wherein the 
cont e nt word generating computer-readable code for analyzing the content includes: 

computer-readable code for generating the a content word list based on the 
content words that are included in the contexts of at least two of the citing documents , and for 
assigning each of said content words a frequency count which is used in calculating the content 
score . 

34. (Currently Amended) The computer-readable memory of claim [[32]] 31, wherein the 
content word gene r ating computer-readable code for analyzing the content includes: 

computer-readable code for generating the a content word list based on the 
content words that are included both in the cited document itself and in the context of at least one 
citing document , and for assigning each of said content words a frequency count which is used in 
calculating the content score . 



Claims 35-45 (Canceled). 
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46. (New) The method of claim 2, wherein the step of generating a content word list 
includes the steps of: 

associating paragraphs from the documents; 

processing text in the associated paragraphs to eliminate noise words that convey 
little information about paragraph content; 

determining common words that are not eliminated by the processing step and that 
are found in at least two paragraphs; 

tallying frequency counts that indicate respective numbers of paragraphs within 
which the common words are encountered, said frequency counts indicating a degree of 
relevance for respective common words; and 

forming the content word list to include the common words linked to respective 
frequency counts. 

47. (New) The method of claim 46, wherein the step of determining includes stemming 
the common words of the associated paragraphs to a length that preserves their essential 
character while eliminating characters that convey little information about word identity. 

48. (New) The method of claim 2, wherein the step of calculating content scores 
includes the steps of: 

calculating respective initial content scores (ICS) for the sentences in the citing 
documents, based on the content words in the sentences; 
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calculating respective distances of the sentences in the citing documents from 
respective citing instances of the cited document; and 

calculating respective content scores (CS) for the sentences in the citing 
documents, based on at least the ICS and the distances. 

49. (New) The method of claim 48, wherein the step of calculating content scores 
further includes the step of normalizing the ICS to form normalized initial content scores (NICS) 
for use by the CS calculation step, said normalizing step taking into account numbers of words in 
the respective sentences and a largest frequency count in the content word list. 

50. (New) The method of claim 48, wherein the step of calculating content scores 
further includes the step of modifying the distances to form respective modified absolute 
distances for use by the the CS calculation step, said step of distance modification being based 
upon criteria relating to predetermined statistical observations of implications of placement of a 
sentence in the citing document relative to the citing instance, said criteria including whether a 
sentence is in a same paragraph with the citing instance or is located after the citing instance. 

5 1 . (New) The apparatus of claim 1 7, wherein the means for generating a content word 
list includes: 

means for associating paragraphs from the documents; 
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means for processing text in the associated paragraphs to eliminate noise words 
that convey little information about paragraph content; 

means for determining common words that are not eliminated by the processing 
step and that are found in at least two paragraphs; 

means for tallying frequency counts that indicate respective numbers of 
paragraphs within which the common words are encountered, said frequency counts indicating a 
degree of relevance for respective common words; and 

means for forming the content word list to include the common words linked to 
respective frequency counts. 

52. (New) The apparatus of claim 51, wherein the means for determining includes 
means for stemming the common words of the associated paragraphs to a length that preserves 
their essential character while eliminating characters that convey little information about word 
identity. 

53. (New) The apparatus of claim 17, wherein the means for calculating content scores 
includes: 

means for calculating respective initial content scores (ICS) for the sentences in 
the citing documents, based on the content words in the sentences; 

means for calculating respective distances of the sentences in the citing documents 
from respective citing instances of the cited document; and 
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means for calculating respective content scores (CS) for the sentences in the citing 
documents, based on at least the ICS and the distances. 

54. (New) The apparatus of claim 53, wherein the means for calculating content scores 
further includes means for normalizing the ICS to form normalized initial content scores (NICS) 
for use by the CS calculation step, said normalizing means taking into account numbers of words 
in the respective sentences and a largest frequency count in the content word list. 

55. (New) The apparatus of claim 53, wherein the means for calculating content scores 
further includes means for modifying the distances to form respective modified absolute 
distances for use by the the CS calculation step, said distance modification means using criteria 
relating to predetermined statistical observations of implications of placement of a sentence in 
the citing document relative to the citing instance, said criteria including whether a sentence is in 
a same paragraph with the citing instance or is located after the citing instance. 

56. (New) The computer-readable memory of claim 32, wherein the computer-readable 
code for generating a content word list includes: 

computer-readable code for associating paragraphs from the documents; 
computer-readable code for processing text in the associated paragraphs to 
eliminate noise words that convey little information about paragraph content; 
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computer-readable code for determining common words that are not eliminated by 
the processing step and that are found in at least two paragraphs; 

computer-readable code for tallying frequency counts that indicate respective 
numbers of paragraphs within which the common words are encountered, said frequency counts 
indicating a degree of relevance for respective common words; and 

computer-readable code for forming the content word list to include the common 
words linked to respective frequency counts. 

57. (New) The computer-readable memory of claim 56, wherein the computer-readable 
code for determining includes computer-readable code for stemming the common words of the 
associated paragraphs to a length that preserves their essential character while eliminating 
characters that convey little information about word identity. 

58. (New) The computer-readable memory of claim 32, wherein the computer-readable 
code for calculating content scores includes: 

computer-readable code for calculating respective initial content scores (ICS) for 
the sentences in the citing documents, based on the content words in the sentences; 

computer-readable code for calculating respective distances of the sentences in the 
citing documents from respective citing instances of the cited document; and 

computer-readable code for calculating respective content scores (CS) for the 
sentences in the citing documents, based on at least the ICS and the distances. 
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59. (New) The computer-readable memory of claim 58, wherein the computer-readable 
code for calculating content scores further includes computer-readable code for normalizing the 
ICS to form normalized initial content scores (NICS) for use by the CS calculation step, said 
normalizing computer-readable code taking into account numbers of words in the respective 
sentences and a largest frequency count in the content word list. 

60. (New) The computer-readable memory of claim 58, wherein the computer-readable 
code for calculating content scores further includes computer-readable code for modifying the 
distances to form respective modified absolute distances for use by the the CS calculation step, 
said distance modification computer-readable code using criteria relating to predetermined 
statistical observations of implications of placement of a sentence in the citing document relative 
to the citing instance, said criteria including whether a sentence is in a same paragraph with the 
citing instance or is located after the citing instance. 
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